Language Universals: Cross-lingual Comparison of Topic Dependent Adjectives
نویسندگان
چکیده
منابع مشابه
Cross-lingual Text Classification Using Topic-Dependent Word Probabilities
Cross-lingual text classification is a major challenge in natural language processing, since often training data is available in only one language (target language), but not available for the language of the document we want to classify (source language). Here, we propose a method that only requires a bilingual dictionary to bridge the language gap. Our proposed probabilistic model allows us to...
متن کاملCross-Lingual Latent Topic Extraction
Probabilistic latent topic models have recently enjoyed much success in extracting and analyzing latent topics in text in an unsupervised way. One common deficiency of existing topic models, though, is that they would not work well for extracting cross-lingual latent topics simply because words in different languages generally do not co-occur with each other. In this paper, we propose a way to ...
متن کاملMultilingual and cross-lingual news topic tracking
We are presenting a working system for automated news analysis that ingests an average total of 7600 news articles per day in five languages. For each language, the system detects the major news stories of the day using a group-average unsupervised agglomerative clustering process. It also tracks, for each cluster, related groups of articles published over the previous seven days, using a cosin...
متن کاملAdaptive Topic { Dependent Language
This paper presents two extensions of the standard interpolated word trigram and cache model, namely the extension of the trigram model by useful word m{grams with m > 3 resulting into a varigram model , and the addition of topic{speciic trigram models. We give the criteria for selecting useful m{grams and for partitioning the training corpus into topic{ speciic subcorpora. We apply both extens...
متن کاملLanguage-Dependent and Language-Independent Approaches to Cross-Lingual Text Retrieval
We investigates the effectiveness of language-dependent approaches to document retrieval, such as stemming and decompounding, and constrast them with language-independent approaches, such as character n-gramming. In order to reap the benefits of more than one type of approach, we also consider the effectiveness of the combination of both types of approaches. We focus on document retrieval in ni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Universal Language
سال: 2004
ISSN: 1598-6381
DOI: 10.22425/jul.2004.5.1.21